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Description 



[0001] The present invention relates generally to computer controlied printing of documents having text and graphical 
components, and particularly to methods and systems for translating documents, represented in a structured page 
s description language, Into a number of different formats suitable for use with a variety of printing devices and also for 
transmission to other devices such as facsimile transceivers. 



BACKGROUND OF THE INVENTION 
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is 



[0002] Prior to the introduction of laser printers in 1 980, the control commands transmitted by computers to printers 
were so-called escape sequence commands because commands were distinguished from character data by preceding 
each command with a special byte called the escape character. This methodology worked well with daisy wheel and 
dot matrix printers, but was not well suited for printing documents that combined text and graphical images 
[0003] A new type of printer control methodology, using a "Page Description Language" (PDL) was developed to 
control laser printers. Various PDL's were developed in the 1980s, the best known examples being PostScript (a trade- 
mark of Adobe Systems Incorporated) and Interpress, although a number of proprietary PDL's are used by different 
printers. These prior art PDLs introduced many usefui printer control methodologies, including such tools as Resource 
Declarations, Context Declarations, Dictionaries, the use of memory stacks, as well as a large number of predefined 
commands for defining specific graphical image elements, for controlling the contents of the printer controller's memory, 
so and so on. These features of the prior art PDLs are extensively documented in publicly available manuals such as 
Adobe System Incorporated's "PostScript Language Reference Manual" and its "PostScript Language Program De- 
sign", both published Addison -Wesley Publishing Company. Another publication concerning PDL's is "Interpress The 
Source Book" by Steven J. Harrington and Robert R. Buckley, published by Simon & Schuster, Inc. (1 988) A publication 
concerning a proposed standard page description language (SPDL) which organizes documents in a hierarchical man- 
25 ner is "ISO/IEC DIS 101B0, Information Processing - Text Communication - Standard Page Description Language" 
(1 991 ). 

[0004] One shortcoming of PostScript is the fact that the Page Description for a particular document can contain 
new definition, such as a new resource definition (e.g., for an additional font to be used in the document) or a new 
dictionary definition anywhere within the document. As a result, the entire contents of the document must be inspected 
30 m order to determine whether a particular printer has the resources necessary to print a particular document. Alternately, 
it is quite possible for the printing of a document to fail at any point during the printing process due to the inability of 
the printer to comply with the commands of the document's page description. 

[0005] Another problem associated with PostScript is that in order to print a specified page of a document it is 
necessary to read the entire PDL description of all the preceding pages of the document in order to determine the state 
of the documents page setup parameters (i.e., Resource Declarations, Dictionary Definitions, and so on) at the begin- 
ning of the specified page. In other words, the print controller or a print driver program must read the entire PDL 
description of the document to take into account the effect of every page setup command between the beginning of 
the document and the specified page. While this page setup scanning process is straightforward, it is wasteful. Adobe 
System Inc. has proposed programming conventions to avoid or reduce this problem. 

[0006] Interpress uses free formatted prologues which can be used to avoid the above problem. The Standard Page 
Descnption Language proposed by ISO on the other hand uses prologues having a predetermined fixed format. 
[0007] Other shortcomings in the prior art include the failure to provide systems which are designed to translate 
documents between various printer command formats, various page description languages, as well as other types of 
devices such as facsimile machines. 

[0008] An article entitled "Printer controller ICs take on PDL bottlenecks" published in "Computer Design", Vol. 27, 
No. 13, July 1988 at pages 26 to 30 discloses configurations intended to solve the above problems. These'take the 
form of a coprocessor-based raster printer accelerator and a microprocessor-based printer controller chip. 
[0009] The present invention provides an improved image processor, which processes documents represented by 
statements In a structured page description language (such as the Standard Page Description Language proposed by 
ISO), converts documents between a variety of different document description formats, and also transmits documents 
to remote devjces in accordance with the resources available at those remote devices, thereby using the most efficient 
data transmission format which is compatible with the receiving remote device. 

SUMMARY OF THE INVENTION 

[0010] According to the invention, there is provided a document image printing controller as set out in Claim 1 
Preferred features of the invention are set out in Claims 2 to 7. 

[001 1] In summary, the present invention provides a document processing system for controlling the printing of doc- 
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uments represented in page description language form. Documents are represented by a page description language 
which is structured so that definition and declaratory commands are positioned only at the beginning of each distinct 
document segment. More specifically, each document has optional prologue sections, which contain definition and 
declaratory commands, and content portions which contain the specific tokens or commands for defining specific !m- 
5 ages. Furthermore, the definition and declaratory commands in the prologue sections of the document are arranged 
in a hierarchical tree so that each definition and declaratory command has a scope corresponding to the portion of the 
hierarchical tree subtended by that command. 

[0012] The document processing system may include several distinct sets of software for processing different por- 
tions of each document A Structure Processor handles resource declaration and definitions, dictionary generation, 
io context declarations and references to data external to the document. A Content Processor processes the tokens using 
the definitions and declarations set up by the Structure Processor. In addition, an Imaging Driver Module translates 
the document into commands suitable for any of several types of printers, as well as for communication by telephone 
line to a remote device. 

[001 3] An important advantage of the present invention is that any specified portion of a document can be processed 
is or printed without having to process the entire document prior to the specified portion. Only structural definitions in the 
hierarchical tree above the specified document portion need to be processed. This is both efficient, and also facilitates 
determination of the resources needed by the document prior to commencing actual printing of the document. This 
feature is useful not only when printing a document, but also when transmitting a document to a remote device. In 
accordance with the present invention, the document processing system queries the remote device to determine wheth- 
20 er it has the resources required for receiving and processing a PDL, Post Script or HP Laserjet encoded document, 
and then either transmits the document in rasterized form (e.g., to a fax machine or to other "raster output engine" 
devices such as a laser printer connected to the document processing system via a video port) if the required resources 
are not available, or transmits the document in a higher level encoded form if the required resources are available. By 
determining the resources available to the remote device, the most efficient transmission format can be used, thereby 
25 reducing transmission costs. 



BRIEF DESCRIPTION OF THE DRAWINGS 



[0014] Additional objects and features of the invention will be more readily apparent from the following detailed de- 
30 scription and appended claims when taken in conjunction with the drawings, in which: 

[0015] Figures 1 A and 1 B are block diagrams of a document represented in structured PDL form. 

[0016] Figure 2 is a block diagram of the hardware in a document processing system. 

[0017] Figure 3 is a block diagram of the software modules in the preferred embodiment. 

[0018] Figure 4 depicts the primary data structures used by the software modules in the preferred embodiment. 
3S [001 9] Figures 5A and 5B are flow charts of the process of parsing a document by the Lexical Analyzer and Parser 

of the preferred embodiment. 

[0020] Figure 6 is a flow chart of the process of interpreting structural commands by the Structure Processor of the 
preferred embodiment 

[0021] Figure 7 is a block diagram depicting operation of the Content Processor of the preferred embodiment. 
40 [0022] Figure 8 is a block diagram depicting operation of the Imaging Driver Module of the preferred embodiment 
[0023] Figure 9 is a block diagram of the communication processor software of the preferred embodiment for trans- 
mitting documents to remote devices. 
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DESCRIPTION OF THE PREFERRED EMBODIMENT 
Hierarchically Structured Page Description Language 



[0024] Referring to Figures 1 A and 1B, a document 100 used by the present invention is represented by a set of 
page description language (PDL) elements which are divided into page sets, and pictures. Both page sets and pictures 
so can have prologue sections that define structural elements of the document, and these prologue sections are organized 
in a hierarchical fashion so that the declarations and definitions in each prologue are applicable only to the subset of 
the document that is subtended by that prologue in the document's hierarchical structure. 

[0025] A top level page set 1 02 sets up resources, dictionary and external definitions useable by the entire document. 
[0026] For the purposes of this description, resources, dictionary and external definitions perform the same basic 
6S functions as they do in the PostScript page description language. For those not familiar with these terms, the following 
short definitions are provided. Resource declarations and definitions specify fonts, filters, fill patterns, colors, glyphs 
and so on, which are then available to be invoked by tokens in the document Resource declarations bind a name to 
each specified resource, while resource definitions specify the exact nature of each resource. Dictionaries are used 
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to translate key values into specified list of tokens or other values, and thus is similar to a list of macro definitions 
External definitions reference data structures external to a document, one example being the image of a corporate 
logo that Is to be printed at the top or bottom of letters and memoranda 

[0027] In addition to the top level page set 102 are a set of second level of page sets 104, 106 Each page set 104 
has a prologue 110, body 112 and end 114. The page set prologue 110 provides resource definitions 110-A and dec- 
larations 110-B, dictionary generation statements 110-C and external definitions 110-D used by subsections of the 
document. The body 11 2 of a page set consists of one or more pictures 120. A picture 120 corresponds to a contiguous 
segment of a document, such as a page, a portion of a page, or possibly a sequence of several pages. 
[0028] The data structure of each picture 120 comprises an optional prologue 122 followed by a picture body 124 
The picture body 124 can contain one or more sub-pictures 126, as well as a token sequence 128 which defines the 
images In one segment of the document (e.g., the image elements for one page). Picture prologues 122 provide re- 
source declarations and definitions, dictionary generation statements and context definitions used solely by that one 
picture (i.e., that segment of the document). 

[0029] Looking at Figures 1 A-1 B as a whole, the data structure embodying document 1 00 is structured hierarchically 
Prologues, containing definition and declaratory commands, are positioned only at the beginning of each distinct doc- 
ument segment. More specifically, each document has optional prologue sections, which contain definition and declar- 
atory commands, and content portions which contain the specific tokens or commands for defining specific images 
Furthermore, the definition and declaratory commands in the prologue sections of the document are arranged in a 
hierarchical tree so that each definition and declaratory command has a scope corresponding to the portion of the 
hierarchical tree subtended by that command. Thus, for instance, the definition and declaration commands in the 
prologue of Page Set 1 apply only to the pictures in that page set and therefore do not apply to the pictu res in Page Set N 
[0030] A formal definition of the data structure of a document is shown in Table 1 . 



TABLE 1 

?DL Definitions 
document : :« (pageset | picture) 
pageset ::«= (prologue)?, (pageeet_body) ? 

prologue : :« (external_decl* , inf ormative_decl* 

resource_def*, resource_decl*, doc_prod_inst_decl*' 
context_decl*, dict_gen_decl*, set_up_proc*> 

pagesetjbody (pageset | picture) 

picture (prologue)?, <picture_body) ? 

picture_body : :» (picture | tokensequence) * 



All must occur in the order shown. 
One and only one must occur. 
Optional (0 or 1 time) 

Optional and repeatable (0 or more times) 



Image Processor Hardware 

[0031] Referring to Figure 2, the Image Processor 150 of the present invention is preferably implemented as a stand 
alone computer system having a central processing unit (CPU) 1 52 such as the AMD 29000 made by Advanced Micro 
Devices or any of the Motorola 6B000 series microprocessors, and random access memory (RAM) 1 54 for temporary 
data and software storage. ROM (read only memory) 156 and EEPROM (electrically erasable read only memory) 158 
store the system's software modules, which will be described below A clock circuit 1 50 provides required timing signals 
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printer interface(s) 162 couples the Image Processor to various printer devices 164, and host interface 166 couples 
the Image Processor to a host computer 170, such as a Macintosh computer made by Apple Computer, or an IBM PC 
compatible computer. 

[0032] The Image Processor 1 50 can optionally include other features such as display and control panel 172; a hard 
s disk and disk controller 174 for storing items such as documents, font data, and data structures referenced by external 
definition statements; and a communication interface 180 or modem for telecommunication with remote devices such 
as facsimile machines and remotely located computers and printers. 

[0033] The image processor 1 50 of the present invention is presented here as a stand alone device, suitable for use 
as a peripheral device to one or more computers, such as in the print server of a local area network or as a special 
10 printer module incorporated into a computer. However, the image processor 150 may also be used as the engine/ 
controller of a "PDL compatible 0 laser or ink jet printer. 

Image Processor Software Modules 

15 [0034] Referring to Figure 3, the document to be handled by the Image Processor 1 50 is typically generated by an 
application program 190, such as a word processing program, drawing program or document assembly program, ex- 
ecuted by a host computer 170. The application program 1 90 will typically have a print driver sub-program which output 
documents using the structured PDL description described above with reference to Figures 1 A-1 B. The PDL command 
elements can be exported by the application in either binary or clear text form (where binary is a compact machine 

20 language format and clear textte a human readable format). 

[0035] The Image Processor 150 contains several software modules 200-210 which process documents exported 
by the host computer 170. Lexical Analyzer and Parser 200 identifies each separate element of the document, checks 
for errors, disposes of document elements which do not need to be processed, and passes the remaining document 
elements to Structure Processor 202. Structure Processor 202 interprets or decodes structural command elements, 

25 loading appropriate state and image parameter values into the image processor's memory, and passes content ele- 
ments on to the Content Processor 204. The Structure Processor 202 also sends print control commands, such as the 
"page print" command issued after determining that the end of a page has been encountered, directly to the Imaging 
Driver Module 206. In addition, various data structures managed by the Structure Processor 202 are translated into 
the target printer commands, such as PostScript commands, by the Image Driver Module 206. 

30 [0036] If the document is being printed, the Content Processor 204 converts tokens into imaging instructions (some- 
times herein called image data) suitable for the specified target printer. If the document is being transmitted to a remote 
device/the Content Processor 204 will either pass along elements of the document without processing them (e.g., if 
the remote device is a PDL compatible printer), or will convert the tokens into imaging instructions or facsimile data 
suitable for use by the remote device. 

35 [0037] Imaging Driver Module 206 directs the output of the image processor 150 to a specified output device, and 
also handles the task of rasterizing documents being sent to devices that cannot accept higher level printing commands. 
Communication processor 208 coordinates the transmission of documents to and from remote devices. Finally, option 
processor 210 handles document options parameters from the host computer, typically defined by setup procedural 
elements near the beginning of the document. Options parameters control such items as the printer to be used, whether 

ao the document Is to be transmitted onto a telecommunication line, how to handle non-fatal syntax errors in the PDL 
command stream, and so on. 

Image Processor Data Structures 

45 [0038] Referring to Figure 4, the following data structures are used by the image processor to store information 
generated during the printing process. A resources data structure 220 stores definitions of the character fonts and 
other resources defined by the document being processed. The Structure Processor 202 handles the processing of 
storing resource definitions and declarations, which are then used by the Content Processor while processing tokens 
that reference declared resources. 

so [0039] Dictionaries 222 are data structures that store tables of values, as defined by dictionary generating statements 
in the prologue sections of the document. The dictionary stack 224 is an ordered set of pointers to dictionaries in data 
structure 222. Specified dictionaries are pushed onto dictionary stack 224 and popped from the dictionary stack 224 
in accordance with the scope accorded to each dictionary. Referring to Figures 1 A-1 B, a dictionary generated by the 
prologue in the first page set 104 wouid be pushed onto the dictionary stack 224 prior to processing pages of the 

55 document in that page set. That dictionary would be popped off the stack when processing of page set 1 ended and 
processing of another page set began, because the dictionary defined in page set 1 is not applicable to the other page 
sets. 

[0040] Table 2 represents an example of a very short document using a page description language structured in 
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accordance with the present invention. This document defines multiple dictionaries and resets their order within the 
image processor's "dictionary stack", thereby changing the order in which the dictionaries are searched for key values. 



TABLE 2 
Example of Document 

< IDOCTYFZ SPDL PUBLIC •XXX*> 
<SPDL> 

<comeat> 

Test File Reader 

■ 

Copyright 1991 Ricoh Corporation 
All Rights Reserved 
Confidential and Proprietary 

File i ctxtdcll.pro 

Author i Tetsuro Kotoyama 

Version x 0.01 

File Created*: June 5, 1991 
First Draft : June 5, 1991 

Update History: 

Description: 

This is a test file for SPDL syntax checking. Zt creates 
three dictionaries through the Diet. Gen. and manipulates 
the context stack by context declaration* 

Note: Put PSOTDO codes before each SPDL test file. 

- SPDL PSEUDO CODE LISTING - 
SPDL 

Document* picture % three dictionaries are defined 

prologue . - 

Dictionary Generator diet id-alpha size 3 
Dictionary Generator dictid*beta size 4 
Dictionary Generator dictid«gamma size 5 
pictbody 

tokenseg 2 3a ^expect the a in the gamma to be executed 
picture 
prologue 

context deel gamma beta alpha 
pictbody 

tokenseg 2 3a ^expect the a in the alpha to be executed 

< /comment > 

<document> 

<picture spdlid«-SPDL" 
cntnttyp«"SPDLClearText" > 

< comment > spldid and cntnttyp are Public Object ID values 

< /comment > 
<prologue> 

< diet gens > 

<dictgen s iz e « ■ 3 •><<dictid><name> alpha < /name >< /diet id> 
<tokenseq> 

\ The operand stack has the dictionary reference 
dup /a (add) put 
dup /d <div) put 
dup /m {mul} put 
< /tokensegx/dictgen> 



6 



EP 0 538 059 B1 



10 



15 



<dictgan «ix«--4- xdictidxa 
<tokans*o;> 

dup /a (div) put 
dup /d {xaul) put 
dup /a (add) put 
dup /a (sub) put 
</tokansea;> 
< /dictgen> 
KdiCtffttD *is*«"5"> 
<toXen»eq> 

dup /a (xmil) put 
dup /d (add) put 
dup /a (sub) put 
</token*eg> 
<token»«Q> 

dup /■ (div) put 
dup /c (cos) put 
</tohenseg^ 
</dictgen> 
< /diet gens > 



>bata</naa«x/dictld> 



>Q 



</Dft&t></dictid> 



20 



dictionary stack 



25 



30 



beta 
alpha 

< /cPTTTm*mt> 
< /prologue > 

<picbody> 

<tokansecj> 

2 3a 

</token»«g> 
<picture> 

<prologue> 

<ctxtdecl> 



Vexpect the result to be 6 
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<naae > gamma < /same > <n ana >be t a < /name xname > alpha </ n una > 

</ctxtdeel> 
</proiogue> 
<cosfi&ent> 

dictionary stack 
alpha 
beta 
gamma 
< /comment > 
<picbody> 

<tokenseq> 

2 3a ^expect the result to be 5 
</tokenseq> 
</picbody> 
</picture> 
</picbody> 

< /picture > <comment> printing the first page </ comment > 
< /document > 
</SPDIi> 



[0041] The Structure Processor 202 allocates memory for storing dictionaries, but passes dictionary generation com- 
mands to the Content Processor for creating the specific entries in each defined dictionary. The Structure Processor 
202 also pushes and pops dictionary pointers onto and off of the dictionary stack 224 so as to control the scope of 
each dictionary. The Content Processor 204 searches the dictionaries currently in the dictionary stack 224, starting 
55 with the last added dictionary, for "key" values in tokens. If two dictionaries in the stack 224 having conflicting definitions 
for a particular key, the last entered definition for a specified key is the one that is used. 

[0042] Operand stack 226 is a standard operand stack used for temporarily storing parameter values to be used by 
imaging operators (called tokens) in the content portion of the document being processed. 
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[0043] State parameters 228 are a set ot variables or parameters which control the processing of a document. State 
parameters include things such as a "fine joining method parameter" which defines the shaping of joints between 
intersecting or otherwise joined line segments, and a "specified printer" parameter defining the printer port to which 
the document is being sent. State parameters also include imaging parameters such as parameters which define the 
color being printed, the margins of the page being printed, the current clipping region, and the current drawing position 
(e.g., for the next letter of text to be printed). 

[0044] The External ID 230 is a value corresponding to a specified external file, such as a file containing the PDL 
program for a company logo. 

[0045] Document Production Instructions 232 define things such as the number of copies of each page that are to 
be printed, and the paper color to be used (n.b., a paper color instruction Is conveyed by the system to an operator 
console requesting that the specified color paper be loaded into the printer's paper feeder). 

Lexical Analyzer and Parser Process 

[0046] As shown in Figure 3, the first software module to receive data representing the document to be processed 
is the Lexical Analyzer and Parser 200. Referring to Figures 5A and 5B, the Lexical Analyzer and Parser works as 
follows. 

[0047] Referring to Figure 5A, prior to performing any other processing on each document, the Lexical Analyzer and 
Parser reads the first nine characters of the document, converts them to upper case form, and compares that string 
with "<!DOCTYPE n . If there is a match, the Lexical Analyzer and Parser concludes that document is encoded in clear 
text form and sets a corresponding "document type" flag, otherwise ft concludes that the document is encoded in binary 
form and sets the document type flag accordingly. There is a one-to-one correspondence between clear text and binary 
formats. Binary format is more compact, while clear text is easier for humans to read. Whether a document is received 
in either binary or clear text form is of no consequence, except for steps 275-280 described below which provide for 
conversion of documents in one form to the other. After performing this initial data type check, the steps shown in 
Figure 5Aare not used until the beginning of another document is received. 

[0048] Referring to Figure 5B, at step 252 the lexical analyzer and parser routine checks to see if the end of the 
document has been reached. If so, memory allocated to the printing process is de-allocated, and the lexical analyzer 
and parser routine exits at step 254. 

[0049] If the document's end has not been reached, then next element of the document is read in at step 256. The 
element may be either a structural or content element. If the element contains a fatal error (step 258), a corresponding 
error message is printed (step 260) and the program exits, thereby terminating processing of the document. 
[0050] Assuming that a fatal error was not encountered at step 258, processing of the element depends on whether 
the element contains parameters (step 262). Some elements are simple statements or commands that do not have 
parameters, while many document elements contain parameters. If the element does not contain any parameters, 
processing jumps to step 274, as will be described below. 

[0051] 11 the element being processed contains parameters, the parameter values are read at step 262, and then 
tested for errors at step 266. If a parameter error is detected (indicating that illegal or inconsistent parameters values 
were received), processing of the error depends on whether the image processor has been instructed to "struggle" - 
i.e., to attempt to process documents containing non-fatal errors. If the image processor has been instructed to "strug- 
gle" (step 268), the element with the error is discarded by not processing it further and allowing processing to continue 
with the next element in the document. Otherwise, if "struggling" has not been enabled, the image processor's "syntax 
checking mode" flag (one of the image processor's state parameters) is set to syntax checking mode at step 270, 
meaning that nothing further in the document will be printed, and then an error message is printed at step 272 before 
processing resumes with the next element in the document. 

[0052] If no errors in the element's parameters were detected (step 266), the "syntax checking mode" state parameter 
is checked at step 274. If it is set, only syntax checking is being performed and therefore the element is not processed 
further. Otherwise, the next step is to check to see if the "data conversion" option flag has been set (step 275), which 
is a state variable indicating the document is being processed solely for the purpose of converting the document from 
binary to clear text format, or vice versa. If the document is being converted, and the document is currently in binary 
form (step 276), a routine is called for converting the element being processed into clear text form (step 278). On the 
other hand, if the document is currently in clear text form (step 276), a routine is called for converting the element being 
processed into binary form (step 280). 

[0053] If the data conversion option has not been selected, the element's parameters are converted and stored in a 
predefined internal format for later processing at step 282, and then the Structure Processor is called to process the 
current element at step 284. More specifically, the Structure Processor contains routines for processing every type of 
legal element allowed in the Page Description Language, and the appropriate one of these routines is called at step 284. 
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Document Structure Processor 

[0054] Referring to Figure 6, the Structure Processor 202 contains the following software modules whose functions 
are briefly described by the text in Figure 6: document structure manager 300, external description manager 302, 

5 Information declaration manager 304, resource definition manager 306, resource declaration manager 308, document 
production Instruction manager 310, dictionary stack manager 31 2, dictionary generator manager 314, setup procedure 
manager 316, token sequence manager 318, and structure error handler 320. These software modules decode cor- 
responding elements in each prologue section of the document, and store representations of the resulting "printing 
command interpretation environment" as state parameters. 

10 [0055] Token elements are passed by the Structure Processor to the Content Processor by token sequence manager 
318. 

[0056] Document production instructions, such as an instruction to print only pages 7 through 10 of a document are 
handled initially by the document production instruction manager 310 so as to store the appropriate production control 
values in data structure 232 (see Figure 4). Thereafter, the document structure manager 300 uses the stored document 
is production control values to skip over or discard sections of the document corresponding to unselected portions of the 
document, and to push and pop dictionaries onto the dictionary stack so as to provide the appropriate dictionaries for 
each section of the document that is selected for printing. 

Document Content Processor 

20 

[0057] Referring to Figure 7, the token sequence manager 204 has a token manager 350 which receives each token 
element to be processed. There are three special cases in which the token element is not interpreted by the Content 
Processor 204. If the document is not being printed, but instead is being transmitted to a remote device capable of 
processing PDL documents, the token manager 350 passes the element unchanged to the Imaging Driver Module. 

25 Similarly, If the document is being printed by a PDL capable printer (i.e., a printer which can interpret PDL documents), 
the content tokens are passed unchanged by the token handler to the imaging Driver Module. Finally, if the document 
is being printed by a PostScript compatible printer (i.e., a printer which can interpret PostScript language documents), 
the content tokens are converted into equivalent PostScript tokens (using a simple table look-up conversion method- 
ology) by the Token Handler 350 and then passed to the Imaging Driver Module. 

30 [0058] Assuming that none of the exceptional cases just described above applies, and that therefore the token ele- 
ment received needs to be interpreted and converted into image data, the required content processing proceeds as 
follows. Operands for the element are pushed onto the operand stack by an operand stack handler 352, and then the 
operator portion of the element is "executed 0 or interpreted by operator execution controller 354. While interpreting 
token elements, the operator execution controller 354 uses the parameters previously pushed onto the stack and 

35 generates imaging parameter values, represent portions of the document being processed. The imaging parameter 
values are passed to the Imaging Driver Module. 

[0059] A dictionary handler 356 is called by the token handler to convert parameter keys into parameter strings, and 
is called by the operator execution controller 354 to convert operator keys into corresponding strings of operator, and 
sometimes parameter, values. Finally, errors such as stack underflows and overflows, and references to undefined 
40 keys or to a nonexistent dictionary are handled by a content error handler 358. 

Imaging Driver Module 

[0060] Referring to Figure 8, the Imaging Driver Module 206 has an option handler that determines which of several 
45 imaging driver programs 372-378 are to be used to process the document Driver programs are provided for PostScript 
printers, HP Laserjet and HP Laserjet emulation printers, bit map printers, and for sending documents to remote de- 
vices. A bit map printer is one does not accept high level commands and therefore must be sent an image in the form 
of a rasterized bit map. A rasterizer routine 380 converts each page of the document into a bit mapped image, which 
can then be transmitted to a device such as a simple dot matrix or ink jet printer that does not have a built-in PDL or 
so PostScript printer. The rasterizer routine 380 is also used when printing to an HP Laserjet or compatible printer for 
converting all portions of the document, excepting those text portions for which the printer has corresponding built-in 
or downloaded fonts, into bit map form. 

[0061] The Imaging Driver Module 206 runs in parallel with the other modules of the image processor. As imaging 
parameters are written into data structure 228, the Imaging Driver Module 206 picks them up and processes them, 
ss either building up bit map images until a page is ready to be printed, or printing individual elements as they arrive. 



9 



EP 0 538 059 B1 



Communication Processor 

[0062] Referring to Figure 9, the communication processor 208 has the ability to either send a document in facsimile 
form to a remote facsimile machine, or to send a document in PDL format to a remote device that has the resources 
required for processing the document in that format. The communication processor 208 Inspects the resources data 
structure 220 and builds its own resource requirement table 390, which is a listing of the resources, including memory 
fonts, and so on, for processing the specified document. Then an inquiry protocol module 992 established a commu- 
nication connection to a specified remote device, and queries that device to determine if it has ail the resources listed 
in table 390. For instance, this could be done using the protocol for communication between facsimile machines which 
allows for special information to be passed between compatible machines. 

[0063] If the remote device answers these queries with an affirmative answer, then module 394 is called for trans- 
mitting the document to the remote device in PDL document description format. PDL format is not only much more 
compact and therefore more efficient than sending a document in binary map form, It also provides much more infor- 
mation to the receiving device, allowing for instance the document to be easily edited and revised. However, if the 
remote device does not contain the required resources, the document will be transmitted to the remote device by 
module 396 in bit map form, typically using industry standard facsimile G3 or G4 transmission methodology. 



Claims 

1 . A document image printing controller (150), comprising: 

means (180) for receiving a document (100) represented by a stream of page description language elements 
which define said document (100) as a hierarchical tree structure, said received document (100) including a 
hierarchically ordered set of prologue sections (110,122), each prologue section containing respective pro- 
logue elements (11 0-A, 1 1 0-D) and subtending at least one content section (1 28) which contain image defining 
token elements; wherein said prologue elements (110-A, 110-D) in each said prologue section (110, 122) are 
applicable only to those of said content sections (128) which are subtended by said prologue section (110, 
122) in the hierarchical tree structure; 

a document structure processor (202) which processes said prologue elements (110-A, 110-D), and stores 
representations thereof in a computer memory so that stored representations of only those prologue elements 
(110-A, 110-D) applicable to each content section (128) of said document (100) are available to said content 
processor (204) while processing each said content section (128) of said document (100); 
a content processor (204) for processing said content sections (128) of said received document (100) and 
generating corresponding imaging instructions for a predefined printer device;and 

an imaging driver processor (206) for formatting and transmitting said imaging instructions to a printer port 
corresponding to said predefined printer device. 

2. A document image printing controller according to claim 1 , wherein said imaging driver processor (206) includes 
a first printer driver (372, 374) for transmitting said imaging instructions to a first printer port using high level page 
description language commands. 

3. A document image printing controller according to claim 1 or 2, wherein said imaging driver processor includes a 
second printer driver (376) and a rasterizer (380) for converting said imaging instructions into an image bit map 
and then transmitting said image bit map to a second printer port. 

4. The document image printing controller of claim 3, wherein said second printer port is a communication port for 
sending images to a remotely located facsimile machine and said image bit map data is transmitted using a pre- 
defined facsimile transmission protocol. 

5. A document image printing controller according to claim 1 , wherein said imaging driver processor (206) includes 
a first printer driver (372, 374) for transmitting said imaging instructions to a first printer port using high level page 
description language commands, and a second printer driver (376) and a rasterizer (380) for converting said im- 
aging instructions into an image bit map and then transmitting said image bit map to via a communication port to 
a remotely located printing device; 

said document image printing controller including means (220) for generating a required resource list repre- 
senting a set of resources which would be required for a remotely located printing device to print said received 
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document (100) were said received document (100) to be transmitted to said remotely located printing device 
using high level page description language commands; 

said second printer driver (376) including protocol means for querying said remotely located printing device 
to determine whether said remotely located printing device has resources corresponding to those represented 
5 by said required resource list, means for transmitting said imaging instructions using high level page description 

language commands when said remotely located printing device responds affirmatively to said querying by 
said protocol means, and means for transmitting said imaging instructions as an image bit map when said 
remotely located printing device does not respond affirmatively to said querying by said protocol means. 

io 6. A document image printing controller as set out in claim 1 , wherein said predefined printer device is configured to 
receive documents in PostScript language form; 

said printer imaging driver processor (206) includes structure converting means for converting said stored 
representations of said prologue elements into corresponding PostScript commands; 
is said content processor (204) includes content converting means for converting said content sections of said 

received document into corresponding PostScript commands; and 

said printer imaging driver processor (206) includes means for transmitting said corresponding PostScript 
commands to said predefined printer device. 

20 7. a document image printing controller according to claim 1 , wherein said predefined printer device is configured to 
receive documents in PostScript language form; 

said document image printing controller including: structure converting means (202) for converting said stored 
representations of said prologue elements into corresponding PostScript commands, and content converting 
25 means (204) for converting said content sections of said received document into corresponding PostScript 

commands; and 

said printer imaging driver processor including means for transmitting said corresponding PostScript com- 
mands to said predefined printer device. 

30 

PatentansprQche 

1. Dokumentenbild-Druckcontrolier (150) mit: 

einer Einrichtung (180) zum Aufnehmen eines Dokuments (100), welches dargestellt ist durch einen Strom 
von Seitenbeschreibungs-Sprachelementen, welche das Dokument (100) als eine hierarchische Baumstruktur 
definieren, wobei das aufgenommene Dokument (100) einen hierarchisch geordneten Satz yon Prologab- 
schnitten (110, 122) enthalt, wobei jeder PrologabschnittjeweiligePrologeiemente (110-A, 110-D) enthalt und 
wobei zumindest ein Inhaltsabschnitt (128) enthalten ist, welcher bilddefinierende Token-Elemente enthalt; 
wobei die Prologelemente (110-A, 110-D) in jedem Prologabschnitt (110, 122) nur auf die Inhaltsabschnltte 
(128) anwendbar sind, welche von dem Prologabschnitt (110, 122) in der hierarchisch en Baumstruktur unter- 
geordnet sind; 

einem Dokumentenstrukturprozessor (202), welcher die Prologelemente (110-A, 110-D) verarbeitet und Dar- 
steilungen davon in einem Computerspeicher so speichert, daB die gespeicherte Darsteilungen von nur den 
Prologelementen (110-A, 110-D), welche auf die jeweiligen Inhattsabschnitte (128) des Dokuments (100) an- 
wendbar sind, fur den Inhaltsprozessor (204) verf Ogbar sind, wahrend er den jeweiligen Inhaltsabschnitt (1 28) 
des Dokuments (100) verarbeitet; 

einem Inhaltsprozessor (204) zum Verarbeiten der Inhaltsabschnltte (128) des aufgenommenen Dokuments 
(100) und zum Erzeugen entsprechender Bildanweisungen f Or eine vordefinierte Druckervorrichtung; und 
einem Bildtreiberprozessor (206) zum Formatieren und Obertragen der Bildinstruktionen an einen Druckerport 
entsprechend der vordefinierten Druckervorrichtung. 

2. Dokumentenbild-Druckcontrolier nach Anspruch 1 , wobei der Bildtreiberprozessor (206) einen ersten Druckertrei- 
ber (372, 374) zum Obertragen der Bildanweisungen an einen ersten Druckerport unter Verwendung von Hoch- 

55 pegel-Seitenbeschreibungs-Sprachbefehlen aufweist. 

3. Dokumentenbild-Druckcontrolier nach Anspruch 1 oder 2, wobei der Bildtreiberprozessor einen zweiten Drucker- 
treiber (376) und eine Rasterungseinrichtung (380) zum Umwandeln der Bildanweisungen in einen Bildbitplan und 
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darauffolgendes Qbertragen des Bildbitplans an einen zweiten Druckerport aufweist 

4. Dokumentenbild-Druckcontroller nach Anspruch 3, wobei der zweite Druckerport ein Kommunikationsport zum 
Senden von Blldern an eine entfernt gelegene Facsimile-Maschine ist und die Bildbitplandaten unter Verwendung 
eines vordef inierten Facsimile-Qbertragungsprotokolls Qbertragen werden. 

5. Dokumentenbild-Druckcontroller nach Anspruch 1 , wobei der Bildtreiberprozessor (206) einen ersten Druckertrei- 
ber (372, 374) zum Qbertragen der Bildanweisungen an einen ersten Druckerport unter Verwendung von Hoch- 
pegel-SeHenbeschreibungs-Sprachbefehlen und einen zweiten Druckertreiber (376) und eine Rasterungseinrich- 
tung (380) zum Umwandeln der Bildanweisungen in einen Bildbitpian und darauffolgendes Qbertragen des Bild- 
bitplans uber einen Kommunikationsport an eine entfernt gelegene Druckervorrichtung aufweist; 

wobei der Dokumentenbild-Druckcontroller eine Einrichtung (220) zum Erzeugen einer erforderlichen Res- 
sourcenliste zum Darstelfen eines Ressourcensatzes aufweist, welcher erforderlich ware f Or eine entfernt ge- 
legene Druckvorrichtung zum Drucken des empfangenen Dokuments (100), wobei das empfangene Doku- 
ment (100) an die entfernt gelegene Druckvorrichtung unter Verwendung von Hochpegel-Seitenbeschrei- 
bungs-Sprachbefehfen zu Qbertragen ware; 

wobei der zweite Druckertreiber (376) eine Protokoiieinrichtung zum Befragen der entfernt gelegenen Druck- 
vorrichtung aufweist, urn zu bestimmen, ob die entfernt gelegene Druckvorrichtung Ressourcen aufweist ent- 
sprechend jenen, welche durch die erforderliche Ressourcenliste dargestellt sind, einer Einrichtung zum Qber- 
tragen der Bildanweisungen unter Verwendung von Hochpegel-Seitenbeschreibungs-Sprachbefehlen, wenn 
die entfernt gelegene Druckvorrichtung bestatigend auf die Anfrage durch die Protokoll einrichtung antwortet, 
und einer Einrichtung zum Qbertragen der Bildanweisungen als ein Bildbitpian, wenn die entfernt gelegene 
Druckvorrichtung nicht bestatigend auf die Anfrage durch die Protokoiieinrichtung antwortet. 

6. Dokumentenbild-Druckcontroller nach Anspruch 1 , wobei die vordefinierter Druckervorrichtung derart konfiguriert 
ist, daB sie Dokumente in PostScript-Sprachform empf angt; 

der Druckerbildtreiber-Prozessor (206) eine Strukturumwandlungseinrichtung zum Umwandeln der gespei- 
cherten Darsteilungen der Prologelemente in entsprechende PostScript-Befehle aufweist; 
der Inhaltsprozessor (204) eine Inhaltsumwandiungseinrichtung zum Umwandeln der Inhaltsabschnitte des 
empfangenen Dokuments in entsprechende PostScript-Befehle aufweist; und 

der Druckerbildtreiber-Prozessor (206) eine Einrichtung zum Qbertragen der entsprechenden PostScript-Be- 
fehle an die vordefinierte Druckervorrichtung aufweist. 

7. Dokumentenbild-Druckcontroller nach Anspruch 1, wobei die vordefinierte Druckervorrichtung derart konfiguriert 
ist, daS sie Dokumente in PostScript-Sprachform empf angt; 

wobei der Dokumentenbild-Druckcontroller aufweist: eine Strukturumwandlungseinrichtung (202) zum Um- 
wandeln der gespeicherten Darsteilungen der Prologelemente in entsprechende PostScript-Befehle und eine 
Inhaltsumwandiungseinrichtung (204) zum Umwandeln der Inhaltsabschnitte des empfangenen Dokuments 
in entsprechende PostScript-Befehle; und 

wobei der Druckerbildtreiber-Prozessor eine Einrichtung zum Qbertragen der entsprechenden PostScript-Be- 
fehle an die vordefinierte Druckervorrichtung aufweist. 



Revendications 

1. Dispositif (160) de commands d'impression d'images de document, comprenant: 

un moyen (180) destine a recevoir un document (100) represents par un train d'elements de langage de 
description de page qui definissent ledit document (100) sous la forme d'une structure arborescente hierar- 
chisee, ledit document recu (100) comportant un ensemble, hierarchiquement ordonne, de sections de pro- 
logue (110, 122), chaque section de prologue contenant des elements de prologue respectifs (110-A, 110-D) 
et sous-tendant au moins une section de contenu (128) qui contient des elements de signes definissant des 
images ; oil lesdits elements de prologue (110-A, 110-D) de chaque dite section de prologue (110, 122) ne 
sont applicables qu'a celles desdites sections de contenu (128) qui sont sous-tendues par ladite section de 
prologue (11 0, 122) dans la structure arborescente hierarchisee ; 
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un processeur (202) de structure de document qui traite lesdits elements de prologue (11 0-A, 110-D), et stocke 
des representations de ceux-ci dans une memoire d'ordinateur de facon que des representations stockees 
des eeuls elements de prologue (110-A, 110-D) qui sont applicables a chaque section de contenu (128) dudit 
document (100) soient disponibles audit processeur de contenu (204) pendant le traitement de chaque dite 
section de contenu (128) dudit document (100); 

un processeur de contenu (204) servant a traite r lesdites sections de contenu (128) dudit document recu (1 00) 
et a produire des instructions de formation d'image correspondantes destinees a un dispositif d'impression 
pr6d6fini ; et 

un processeur (206) de commande de formation d'image servant a formater lesdites instructions de formation 
d'image et a les transmettre a un port d'impnmante correspondant audit dispositif d'impression predeflnl 

Dispositif de commande d'impression d'images de document selon ia revendication 1 , oil [edit processeur de 
commande deformation d'image (206) comporte un premier gestionnaire d'impnmante (372, 374) servant a trans- 
mettre lesdites instructions de formation d'image a un premier port d'imprimante a I'aide d* instructions de langage 
de description de page de niveau superieur. 

Dispositif de commande d'impression d'images de document selon la revendication 1 ou 2, ou (edit processeur 
de commande de formation d'image comporte un deuxieme gestionnaire d'imprimante (376) et un moyen de mise 
sous forme tramee (380) servant a convertir lesdites instructions de formation d'image en une grille de points 
d'image, ou image matricielle, puis a transmettre ladite grille de points d'image a un deuxieme port d'imprimante. 

Dispositif de commande d'impression d'images de document selon la revendication 3, ou le deuxieme port d'im- 
primante est un port de communication servant a envoyer des images a un telecopieur place a distance et les 
donnees de ladite grille de points d'image sont transmises a I'aide d'un protocols de transmission de telecopie 
predeflnl. 

Dispositif de commande d'impression d'images de document selon la revendication 1 , ou ledit processeur de 
commande de formation d'image (206) comporte un premier gestionnaire d'imprimante (372, 374) servant a trans- 
mettre lesdites instructions de formation d'image a un premier port d'imprimanle a I'aide d 1 instructions de langage 
de description de page de niveau superieur, et un deuxieme gestionnaire d'imprimante (376) et un moyen de mise 
sous forme tramee (380) servant a convertir lesdites instructions de formation d'image en une grille de points 
d'image, ou image matricielie, puis a transmettre ladite grille de points d'image via un port de communication a un 
dispositif d'impression plac6 a distance ; 

ledit dispositif de commande d'impression d'images de document comportant un moyen (220) servant a pro- 
duire une liste de ressources demandees representant un ensemble de ressources qui est demande pour 
qu'un dispositif d'impression place a distance imprime ledit document recu (100) si ledit document recu (100) 
doit etre transmrs audit dispositif d'impression place a distance a I'aide d'inst ructions de langage de description 
de page de niveau superieur ; 

ledit deuxieme gestionnaire d'imprimante (376) comportant un moyen du type protocole servant a interroger 
ledit dispositif d'impression place a distance pour determiner si ledit dispositif d'impression place a distance 
possede des ressources correspondant a celles representees par ladite liste de ressources demandees, un 
moyen servant a transmettre lesdites instructions de formation d'image a I'aide destructions de langage de 
description de page de niveau superieur lorsque ledit dispositif d'impression place a distance repond affirma- 
tivement a ladite interrogation faite par (edit moyen du type protocole, et un moyen servant a transmettre 
lesdites instructions de formation d'image sous la forme d'une grille de points d'image, ou image matricielle, 
lorsque ledit dispositif d'impression place a distance ne repond pas affirmativement a ladite interrogation faite 
ledit moyen du type protocole. 

Dispositif de commande d'impression d'images de document selon la revendication 1 , ou ledit dispositif d'impres- 
sion predefini est configure de facon a recevoir des documents sous forme de langage PostScript ; 

ledit processeur de commande de formation d'image (206) d'imprimante comporte un moyen de conversion 
de structure servant a convertir les representations stockees desdits elements de prologue en instructions 
PostScript correspondantes ; 

ledit processeur de contenu (204) comporte un moyen de conversion de contenu servant a convertir lesdites 

sections de contenu dudit document recu en instructions PostScript correspondantes ; et 

ledrt processeur de commande de formation d'image (206) d'imprimante comporte un moyen servant a trans- 
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metlre lesdites instructions PostScript correspondantes audit dispositif compression pr6d§fini. 

Dispositif de commande depression d'images de document selon la revendication 1 , oil ledit dispositif d'impres- 
sion predefini est configure pour recevoir des documents sous forme de langage PostScript ; 

iedit dispositif de commande depression d'image de document component : un moyen (202) de convereion 
de structure servant a convertir lesdites representations stockees desdits elements de prologue en instructions 
PostScript correspondantes et un moyen (204) de conversion de conten u servant a convertir iesdites sections 
de contenu dudit document regu en instructions PostScript correspondantes ; et 

ledit processeur de commande de formation d'image d'imprimante comportant'un moyen qui sert a transmettre 
lesdites instructions PostScript correspondantes audit dispositif depression predSfini, 
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Passing Data to Content Processor. 



Structure Error Handler 
" Controls Error Handling 
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